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CDNA FOR HUMAN METHYLENETETRAHYDROFQIATE REDUCTASE 

BACKGROUND OF THE INVENTION 

(a) Field of the Invention 

5 The invention relates to a cDNA probe for human 

methylenetetrahydrof olate reductase (MTHFR), and its 
uses . 

(b) Description of Prior Art 

Folic acid derivatives are coenzymes for 

10 several critical single-carbon transfer reactions, 
including reactions in the biosynthesis of purines, 
thymidylate and methionine . Methylenetetrahydrof olate 
reductase (MTHFR; EC 1.5,1,20) catalyzes the NADPH- 
linked reduction of 5 , 10-methylenetetrahydrof olate to 

15 5-methyltetrahydrof olate, a co-substrate for methyla- 
tion of homocysteine to methionine. The porcine liver 
enzyme, a f lavoprotein , has been purified to homogene- 
ity; it is a homodimer of 77-kDa subunits - Partial 
proteolysis of the porcine peptide has revealed two 

20 spatially distinct domains: an N-terminal domain of 40 
kDa and a C-terminal domain of 37 kDa. The latter 
domain contains the binding site for the allosteric 
regulator S-adenosylmethionine . 

Hereditary deficiency of MTHFR, an autosomal 

25 recessive disorder, is the most common inborn error of 
folic acid metabolism. A block in the production of 
methyltetrahydrof olate leads to elevated homocysteine 
with low to normal levels of methionine. Patients 
with severe deficiencies of MTHFR (0 -20% activity in 

3 0. fibroblasts) can have variable phenotypes. Developmen- 
tal delay, mental retardation, motor and gait abnor- 
malities, peripheral neuropathy, seizures and psychi- 
atric disturbances have been reported in this group, 
although at least one patient with severe MTHFR defi- 

3 5 ciency was asymptomatic. Pathologic changes in the 
severe form include the vascular changes that have 
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been found in other conditions with elevated homocys- 
teine, as well as reduced neurotransmitter and 
methionine levels in the CNS. A milder deficiency of 
MTHFR (35-50% activity) has been described in patients 
5 with coronary artery disease (see below),. Genetic 
heterogeneity is likely, considering the diverse 
clinical features, the variable levels of enzyme 
activity, and the differential heat inactivation pro- 
files of the reductase in patients' cells. 

10 Coronary artery disease (CAD) accounts for 25% 

of deaths of Canadians. Cardiovascular risk factors 
(male sex, family history, smoking, hypertension, 
dyslipoproteinemia and diabetes) account for approxi- 
mately 60 to 70% of our ability to discriminate CAD 

15 patients from healthy subjects. Elevated plasma homo- 
cysteine has also been shown to be an independent risk 
factor for cardiovascular disease. 

Homocysteine is a sulf hydryl-containing amino 
acid that is formed by the demethylation of 

20 methionine. It is normally metabolized to cysteine 
( transsulf uration ) or re-methylated to methionine. 
Inborn errors of metabolism (as in severe MTHFR defi- 
ciency) causing extreme elevations of homocysteine in 
plasma, with homocystinuria, are associated with pre- 

25 mature vascular disease and widespread arterial and 
venous thrombotic phenomena. Milder elevations of 
plasma homocysteine (as in mild MTHFR deficiency) have 
been associated with the development of peripheral 
vascular disease, cerebrovascular disease and 

30 premature CAD. 

Homocysteine remethylation to methionine 
requires the folic acid intermediate, 5-methyltetrahy- 
drofolate, which is produced from 5 , 10-methylenetet- 
rahydrof olate folate through the action of 5,10- 

35 methylenetetrahydrof olate reductase (MTHFR). Defi- 
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ciency of MTHFR results in an inability to metabolize 
homocysteine to methionine; elevated plasma homocys- 
teine and decreased methionine are the metabolic con- 
sequences of the block. Severe deficiencies of MTHFR 
5 (less than 20% of activity of controls) as described 
above, are associated with early-onset neurologic 
symptoms (mental retardation, peripheral neuropathy, 
seizures, etc.) and with atherosclerotic changes and 
thromboembolism. Milder deficiencies of MTHFR (35-50% 

10 of activity of controls), with a thermolabile form of 
the enzyme, are seen in patients with cardiovascular 
disease without obvious neurologic abnormalities. 

In a survey of 212 patients with proven coro- 
nary artery disease, the thermolabile form of MTHFR 

15 was found in 17% of the CAD group and 5% of controls. 
In a subsequent report on 339 subjects who underwent 
coronary angiography, a correlation was found between 
thermolabile MTHFR and the degree of coronary artery 
stenosis. Again, traditional risk factors (age, sex, 

20 smoking, hypertension, etc.) were not significantly 
associated with thermolabile MTHFR. All the studies on 
MTHFR were performed by enzymatic assays of MTHFR in 
lymphocytes, with measurements of activity before and 
after heat treatment to determine thermolability of 

25 the enzyme. 

Since 5-methyltetrahydrof olate, the product 
of the MTHFR reaction, is the primary form of circula- 
tory folate, a deficiency in MTHFR might lead to other 
types of disorders- For example, periconceptual folate 

30 administration to women reduces the occurrence and 
recurrence of neural tube defects in their offspring. 
Neural tube defects are a group of developmental mal- 
formations ( meningomyelocele , anencephaly , encephalo- 
cele) that arise due to failure of closure of the neu- 

35 ral tube. Elevated levels of plasma homocysteine have 
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been reported in mothers of children with neural tube 
defects. The elevated plasma homocysteine could be due 
to a deficiency of MTHFR, as described above for car- 
diovascular disease. 
5 Neuroblastomas are tumors derived from neural 

crest cells. Many of these tumors have been reported 
to have deletions of human chromosome region lp3 6, the 
region of the genome to which MTHFR has been mapped. 
It is possible that MTHFR deletions/mutations are 

10 responsible for or contribute to the formation of this 
type of tumor. MTHFR abnormalities may also contribu- 
tion to the formation of other types of tumors, such 
as colorectal tumors, since high dietary folate has 
been shown to be inversely associated with risk of 

15 colorectal adenomas . 

MTHFR activity is required for homocysteine 
methylation to methionine. Methionine is necessary for 
the formation of S-adenosylmethionine, the primary 
methyl donor for methylation of DNA, proteins, lipids, 

2 0 neurotransmitters, etc. Abnormalities in MTHFR might 
lead to lower levels of methionine and S-adenosylme- 
thionine, as well as to elevated homocysteine. Disrup- 
tion of methylation processes could result in a wide 
variety of conditions, such as neoplasias, developmen- 

25 tal anomalies, neurologic disorders, etc. 

Although the MTHFR gene in Escherichia coli 
(metF) has been isolated and sequenced, molecular 
studies of the enzyme in higher organisms have been 
limited without the availability of a eukaryotic cDNA. 

30 In this communication, we report the isolation of a 
human cDNA for MTHFR, its chromosomal assignment, and 
the identification of mutations in MTHFR-def icient 
patients. This report represents the first molecular 
description of mutations in MTHFR deficiency. 
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It would be highly desirable to be provided 
with a cDNA probe for human methylenetetrahydrof olate 
reductase (MTHFR). This probe would be used for iden- 
tification of sequence abnormalities in individuals 
5 with severe or mild MTHFR deficiency, including car- 
diovascular patients and patients with neurologic 
symptoms or tumors. The probe would also be used in 
gene therapy, isolation of the gene, and expression 
studies to produce the MTHFR protein. The probe would 
10 also provide the amino acid sequence of the human 
MTHFR protein, which would be useful for therapy of 
MTHFR deficiency by biochemical or pharmacological 
approaches - 

It would be highly desirable to be provided 
15 with a molecular description of mutations in methyl- 
enetetrahydrof olate reductase deficiency- 

SUMMARY OF THE INVENTION 

One aim of the present invention is to provide 
20 a cDNA probe for human methylenetetrahydrof olate 
reductase (MTHFR). 

Another aim of the present invention is to pro- 
vide a molecular description of mutations in methyl- 
enetetrahydrof olate reductase deficiency - 

2 5 Another aim of the present invention is to pro- 

vide a nucleic acid and amino acid sequence for human 
methylenetetrahydrof olate reductase . 

Another aim of the present invention is to pro- 
vide potential therapy for individuals with methyl- 

3 0 enetetrahydrof olate reductase deficiency. 

Another aim of the present invention is to pro- 
vide a system for synthesis of MTHFR protein in vitro. 

A further aim of the present invention is to 
provide for a technology/protocol for identification 
3 5 of sequence changes in the MTHFR gene- 
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BRIEF DESCRIPTION OF THE DRAWINGS 

Figs- lA to IE illustrate the first cDNA coding 
sequence for methylenetetrahydrof olate reductase 
5 ( MTHFR ) ; 

Fig. 2 is the alignment of amino acids for 
human methylenetetrahydrof olate reductase (MTHFR), the 
metF genes from E. coli (ECOMETF), and S. typhlmurium 
(STYMETF), and an unidentified open reading frame in 
10 Saccharomyces cerevdsiae that is divergently tran- 
scribed from an excision repair gene (ysRADl); 

Figs. 3A and 3B illustrate the sequencing and 
restriction enzyme analysis for the Arg to Ter substi- 
tution; 

15 Figs. 4A and 4B illustrate the sequencing and 

restriction enzyme analysis for the Arg to Gin substi- 
tution ; 

Figs. 5A and 5B illustrates the sequence change 
and restriction enzyme analysis for the alanine to 
20 valine substitution; 

Fig- 6 illustrates the total available sequence 
of human MTHFR cDNA; 

Figs. 7A and 73 illustrate the expression 
analysis of MTHFR cDNA in E. colir respectively (7A) 
25 the Western blot of bacterial extracts and tissues, 
and (7B) the thermolability assay of bacterial 
extracts ; 

Figs. 8A to 8D illustrate the identification of 
a 5' splice site mutation leading to a 57 bp in-frame 
30 deletion of the cDNA; 

Figs. 9A to 9D illustrate the diagnostic 
restriction endonuclease analysis of 4 mutations; 

Figs. IDA to lOD illustrate the ASO hybridiza- 
tion analysis of 2 mutations; and 
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Fig. 11 illustrates the region of homology 
between human methylenetetrahydrof olate reductase 
(MTHFR) and human dihydrof olate reductase (DHFR). 

5 DETAILED DESCRIPTION OF THE INVENTION 

Sequencing of peptides from porcine MTHFR 

Homogeneous native porcine MTHFR was digested 
with trypsin to generate a 4 0 kOa N-terminal fragment 

10 and a 31 kDa C-terminal fragment; the 31 kDa fragment 
is a proteolytic product of the 3 7 kDa fragment. The 
fragments were separated by SDS-PAGE, electroeluted, 
and the denatured fragments were digested with lysyl 
endopeptidase (LysO- The resulting peptides were 

15 separated by reversed-phase HPLC and subjected to 
sequence analysis by Edman degradation (details con- 
tained in Goyette P et al . , Nature Genetics, 1994, 
7 :195-200) . 

2 0 Isolation and sequencing of cDNAs 

Two degenerate oligonucleotides were synthe- 
sized based on the sequence of a 30 amino acid porcine 
MTHFR peptide (first underlined peptide in Fig. 2). 
These were used to generate a 90 bp PGR product, 
25 encoding the predicted peptide, from reverse tran- 
scription-PCR reactions of 500 ng pig liver polyA+ 
RNA, A pig-specific (non-degenerate, antisense) PGR 
primer was then synthesized from this short cDNA 
sequence. Using this primer and a primer for phage 

3 0 arms, a human liver T^gtlO cDNA library (Glontech) was 

screened by PGR; this technique involved the genera- 
tion of phage lysate stocks (50,000 pfu) which were 
boiled for 5 mins and then used directly in PGR reac- 
tions with these two primers. PGR fragments were 
3 5 then sequenced directly (Gycle Sequencing™ kit, 
GIBGO), and a positive clone was identified by com- 
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parison of the deduced amino acid sequence to the 
sequence of the pig peptide (allowing for inter-spe- 
cies variations). The positive stock was then 
replated at lower density and screened with the radio- 
5 labelled positive PGR product by plaque hybridization 
until a well-isolated plaque was identified- Phage 
DNA was purified and the insert was then subcloned 
into pBS+ (Bluescript) and sequenced on both strands 
(Cycle Sequencing™ kit, GIBCO and Sequenase™, Pharma- 

10 cia). The deduced amino acid sequence of the human 
cDNA was aligned to the porcine peptide sequences, the 
metF genes from E.cold {ecometf , accession number 
VO1502) and S. typhimurium { stymetf , accession number 
X07 689) and with a previously unidentified open read- 

15 ing frame in Saccharomyces cerevisiae that is diver- 
gently transcribed with respect to the excision repair 
gene, ysRADl (accession number KO2070). The initial 
alignments were performed using BestFit™ in the GCG 
computer package, and these alignments were adjusted 

20 manually to maximize homologies - 

In summary, degenerate oligonucleotide primers 
were designed to amplify a sequence corresponding to a 
3 0-amino acid segment of a porcine peptide from the N- 
terminal region of the enzyme (first porcine peptide 

25 in Fig. 2). A 90-bp porcine cDNA fragment was 
obtained from reverse transcription/PCR of pig liver 
RNA. Sequencing of the PGR fragment confirmed its 
identity by comparison of the deduced amino acid 
sequence to the porcine peptide sequence. A nondegen- 

3a erate oligonucleotide primer, based on the internal 
sequence of the porcine cDNA, was used in conjunction 
with primers for the phage arms to screen a human 
liver A^gtlO cDNA library by PGR. The insert of the 
positive clone was isolated and sequenced. The 
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sequence consisted of 1266 bp with one continuous open 
reading frame. 

Homology with MTHPR in other species 

5 The deduced amino acid sequence of the human 

cDNA was aligned with the metF genes from E.coli and 
S.typhimurium, as well as with a previously unidenti- 
fied ORF in Saccharomyces cerevisiae that is diver- 
gently transcribed with respect to the excision repair 

10 gene. ysRADl (Fig. 2). The sequences homologous to 5 
porcine peptides are underlined in Fig. 2. Three seg- 
ments (residues 61-94, 219-240, and 337-351) corre- 
spond to internal peptide sequence from the N-terminal 
40 kDa domain of the porcine liver enzyme- Residues 

15 374-393 correspond to the upstream portion of the LysC 
peptide from the C-terminal domain of the porcine 
liver enzyme that is labeled when the enzyme is irra- 
diated with UV light in the presence of (^h- 
methyl )AdoMet ; as predicted from the AdoMet labeling 

20 studies, this peptide lies at one end (N-terminal) of 
the 37 kDa domain. A fifth region of homology 
(residues 359-372) was also identified, but the local- 
ization of the porcine peptide within the native pro- 
tein had not been previously determined. 

25 Methylenetetrahydrofolate reductase (MTHFR) is 

an enzyme involved in amino acid metabolism, that is 
critical for maintaining an adequate methionine pool, 
as well as for ensuring that the homocysteine concen- 
tration does not reach toxic levels. The high degree 

30 of sequence conservation, from E.coli to Homo sapiens, 
attests to the significance of MTHFR in these species. 
The enzyme in E.coli (encoded by the metF locus) is a 
33 kDa peptide that binds reduced FAD and catalyzes 
the reduction of methylenetetrahydrofolate to 
3 5 methyltetrahydrofolate. The metF enzyme differs from 
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the mammalian enzyme in that it cannot be reduced by 
NADPH or NADH, and its activity is not allosterically 
regulated by S-adenosylmethionine. The native porcine 
enzyme is susceptible to tryptic cleavage between the 
5 N-terminal 40 kDa domain and the C-terminal 37 kDa 
domain, and this cleavage results in the loss of 
allosteric regulation by adenosylmethionine , but does 
not result in loss of catalytic activity. Since the 
homology between the bacterial and mammalian enzymes 

10 is within the N-terminal domain, this region must con- 
tain the flavin binding site and residues necessary to 
bind the folate substrate and catalyze its reduction. 
The domain structure of the human enzyme has not been 
elucidated, although the human enzyme has been 

15 reported to have a molecular mass of 150 kDa and is 
likely to be a homodimer of 77 kDa. 

We predict that the point of cleavage between 
the two domains lies between residues 3 51 and 374 of 
the human sequence, based on the localization of pep- 

20 tides obtained from the isolated domains of the por- 
cine enzyme- This region, containing the highly- 
charged sequence KRREED, is predicted to have the 
highest hydrophilicity ind surface probability of any 
region in the deduced human sequence. 

25 The N-terminus of the porcine protein has been 

sequenced, and the region encoding this part of the 
protein is missing from the human cDNA. We estimate 
that this cDNA is missing only a few residues at the 
N-terminus, since the predicted molecular mass of the 

30 deduced sequence upstream of the putative cleavage 
site (KRREED) is 40 kDa, and the measured molecular 
mass of the porcine N-terminal domain is also 40 kDa. 
When the bacterial, yeast and human sequences are 
aligned, the deduced human sequence contains an N-ter- 

35 minal extension of 40 amino acids; we suspect that 
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this extension contains determinants for NADPH bind- 
ing- Many pyridine nucleotide-dependent oxidoreduc- 
tases contain such determinants at the N-terminus of 
the protein. 

5 The C-terminus of the human sequence contains a 

peptide that is labeled when the protein is irradiated 
with ultraviolet light in the presence of tritiated 
AdoMet. The cDNA sequence we report here contains 
only about 7 kDa of the predicted 37 kDa mass of this 

10 domain, indicating that this cDNA is truncated at the 
3* terminus as well. A number of peptides from the C- 
terminal porcine domain have also not been detected. 
As might be expected, given that the prokaryotic 
enzymes do not appear to be allosterically regulated 

15 by AdoMet, there are no significant homologies between 
the C-terminal region in this cDNA and the prokaryotic 
metF sequences. The alignment shown in Fig. 2 shows 
that the homologous sequences terminate just prior to 
the putative cleavage site of the human enzyme. 

20 

Chromosomal assignment 

In situ hybridization to metaphase human chrc 
mosomes was used for localization of the human gene. 
The analysis of the distribution of 200 silver grains 
25 revealed a significant clustering of grains 40 grains, 
in the p36.3-36*2 region of chromosome 1 (p<0.0001), 
with the majority of grains, 25 grains, observed over 
lp36.3. 

The isolation of the human cDNA has allowed us 
30- to localize the gene to chromosome lp36.3. The obser- 
vation of one strong signal on that chromosome with 
little background is highly suggestive of a single 
locus with no pseudogenes . Southern blotting of human 
DNA revealed fragments of approximately 10 kb, pre- 
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dieting a gene of average size, since this cDNA 
encodes approximately half of the coding sequence. 

Additional cDNA sequences and constructs for expres- 
5 sion analysis 

A human colon carcinoma cDNA library (gift of 

Dr. Nicole Beauchemin, McGill University) was screened 

by plaque hybridization with the original 1.3 kb cDNA 

to obtain additional coding sequences. A cDNA of 2.2 

10 kb was isolated, which contained 1.3 kb of overlapping 
sequence to the original cDNA and 900 additional bp at 
the 3' end (Fig. 6). The amino acid sequence is iden- 
tical to that of the original cDNA for the overlapping 
segment (codons 1-415) except for codon 177 (ASP) 

15 which was a GLY codon in the original cDNA. Analysis 
of 50 control chromosomes revealed an ASP codon at 
this position. The cDNA has an open reading frame of 
1980 bp, 100 bp of 3 • UTR and a poly A tail. 

Sequencing was performed on both strands for 

20 the entire cDNA. Additional 5* sequences (800 bp) 
were obtained from a human kidney cDNA library 
(Clontech) but these sequences did not contain addi- 
tional coding sequences and were therefore used for 
the PCR-based mutagenesis only (as described below) 

25 and not for the expression analysis. The two cDNAs 
(2.2 kb and 800 bp) were ligated using the EcoRl site 
at bp 199 and inserted into the Bluescript™ vector 
( Stratagene) . The 2.2 kb cDNA was subcloned into the 
expression vector pTrc99A (Pharmacia) using the Ncol 

30 site at bp 11 and the Xbal site in the polylinker 
region of both the Bluescript™ and the pTrc99A vec- 
tors. Sequencing was performed across the cloning 
sites to verify the wild-type construct. 
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UTILITY OF INVENTION IN IDENTIFICATION OF MUTATIONS 

I. Identification of first two mutations in severe 
MTHFR deficiency 

Total RNA of skin fibroblasts from MTHFR-defi- 
5 cient patients was reverse-transcribed and amplified 
by PGR for analysis by the single strand conformation 
polymorphism (SSCP) method (Orita, M. et al.. Genom- 
ics, 1989, 5.: 8874-8879 ) , Primers were designed to 
generate fragments of 250-300 bp and to cover the 

10 available cDNA sequences with small regions of overlap 
for each fragment at both ends. The first mutation 
identified by SSCP was a C to T substitution at bp 559 
in patient 1554; this substitution converted an 
arginine codon to a termination codon (Fig. 3A) . 

15 Since the mutation abolished a Fokl site, restriction 
digestion was used for confirmation of the change and 
for screening additional patients for this mutation; a 
second patient (1627) was identified in this manner 
(Fig. 3B). The SSCP pattern for patient 1554 and the 

20 restriction digestion pattern for both patients was 
consistent with a homozygous mutant state or with a 
genetic compound consisting of the nonsense mutation 
with a second mutation that did not produce any 
detectable RNA (null allele). Studies in the parents 

25 are required for confirmation. 

The second substitution (Fig, 4A) was a G to A 
transition at bp 482 in patient 1834 that converted an 
arginine into a glutamine residue- The substitution 
created a Pstl site which was used to verify the sub- 

30 stitution and to identify a second patient (1863) with 
this change (Fig. 4B ) . The SSCP analysis and the 
restriction digestion pattern were consistent with a 
heterozygous state for both patients- The arginine 
codon affected by this change is an evolutionarily- 

35 conserved residue, as shown in Fig. 2. This observa- 
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tion, in conjunction with the fact that the codon 
change is not conservative, makes a strong argument 
that the substitution is a pathologic change rather 
than a benign polymorphism. Furthermore, 35 controls 
5 (of similar ethnic background to that of the probands) 
were tested for this substitution by Southern blotting 
of Pstl-digested DNA; all were negative. 

The family of patient 1834 was studied- The 
symptomatic brother and the mother of the proband were 
10 all shown to carry this substitution, whereas the 
father was negative for the change (Fig. 4B). In the 
family of 1863, the mother of the proband was shown to 
be a carrier, while the father and an unaffected 
brother were negative. 

15 

Cell lines 

Cell line 1554 is from a Hopi male who was 
admitted at age three months with homocystinuria, sei- 
zures, dehydration, corneal clouding, hypotonia and 

20 Candida sepsis. Folate distribution in cultured 

fibroblasts showed a Pedlococcus 

ceri^risiae/ Lactobacillus casei (PC/LC) ratio of 0.52 
(Control 0.14). There was no measurable methyl- 
enetetrahydrof olate reductase (MTHFR) activity 

25 (Control values = 9.7 and 15.1 nmoles/h/mg protein; 
residual activity after treatment of control extracts 
at 55^C for 20 min. = 28% and 31%). 

Cell line 1627 is from a Choctaw male who pre-- 
sented with poor feeding, apnea, failure to thrive, 

30 dehydration and homocystinuria at five weeks of age. 
He was subsequently found to have superior sagittal 
sinus thrombosis and hydrocephalus. The PC/LC ratio 
was 0.61 and the specific activity of MTHFR was 0.1 
nmoles/h/mg protein. There is consanguinity in that 
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the maternal and paternal grandmothers are thought to 
be "distantly related". 

Cell line 1779 is from a French Canadian male 
with homocystinuria who first had limb weakness, inco- 
5 ordination, paresthesiae , and memory lapses at age 15 
years, and was wheelchair-bound in his early twenties - 
His brother (cell line 1834) also has homocystinuria, 
but is 37 years old and asymptomatic. Specific activ- 
ity of MTHFR was 0.7 and 0.9 nmole/h/mg protein for 

10 1779 and 1834, respectively; the residual activity 
after heat treatment at 55°C was 0.9% and 0% for 1779 
and 1834, respectively. 

Cell line 1863 is from a white male who was 
diagnosed at age 21 years because of a progressive 

15 gait disturbance, spasticity, cerebral white matter 
degeneration, and homocystinuria. He had a brother 
who died at age 21 years of neurodegenerative disease - 
Specific activity of MTHFR in fibroblast extracts was 
1.76 nmoles/h/mg protein and the residual enzyme ac- 

20 tivity after treatment at 55°C was 3.6%. 

Mutation analysis 

Primers were designed from the cDNA sequence to 
generate 250-300 bp fragments which overlapped 50- 

25 75 bp at each end- The primer pairs were used in 
reverse transcription-PCR of 5[xg patient total fibro- 
blast RNA. The PCR products were analyzed by a non- 
isotopic rapid SSCP protocol ( PhastSystem™, Pharma- 
cia), which uses direct silver staining for detection 

30 of single strands. Any PCR products from patients 
showing a shift on SSCP gels were purified by NuSieve 
(FMC Bioproducts) and sequenced directly (Cycle 
Sequencing™ kit, GIBCO) to identify the change. If 
the change affected a restriction site, then a PCR 

35 product was digested with the appropriate restriction 
endonuclease and analyzed on polyacrylamide gels. To 
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screen for the Arg to Gin mutation in controls, 5 (ig 
of Pstl-digested DNA was run on 0.8% agarose gels and 
analyzed by Southern blotting using the radiolabelled 
cDNA by standard techniques. 

5 

II. Seven additional mutations at the methylene- 
tetrahydrof elate reductase (MTHPR) locus with 
genotype: phenotype correlations in severe MTHFR 
deficiency 

10 

It is reported hereinbelow the characterization 
of 7 additional mutations at this locus: 6 missense 
mutations and a 5* splice site defect which activates 
a cryptic splice site in the coding sequence. We also 

15 present a preliminary analysis of the relationship 
between genotype and phenotype for all 9 mutations 
identified thus far at this locus. A nonsense muta- 
tion and 2 missense mutations (proline to leucine and 
threonine to methionine) in the homozygous state are 

20 associated with extremely low activity (0-3%) and 
onset of symptoms within the first year. Other mis- 
sense mutations (arginine to cysteine and arginine to 
glutamine) are associated with higher enzyme activity 
and later onset of symptoms. 

25 7 additional mutations at the MTHFR locus are 

described and the association between genotype, enzyme 
activity, and clinical phenotype in severe MTHFR defi- 
ciency is examined- 

30 Patient description 

The clinical and laboratory findings of the 
patients have been reported in the published litera- 
ture . Residual MTHFR activity was previously measured 
in cultured fibroblasts at confluence. 
3 5 Patient 3 54, an African-American girl, was 

diagnosed at age 13 years with mild mental retarda- 
tion. Her sister, patient 3 55 was diagnosed at age 15 
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years with anorexia, tremor, hallucinations and pro- 
gressive withdrawal. In patient 354, residual MTHFR 
activity was 19% and in her sister, 355, it was 14% of 
control values. The residual activity after heating 
5 had equivalent thermal stability to control enzyme. 

Patient 1807, a Japanese girl whose parents are 
first cousins, had delayed walking and speech until 
age 2 years, seizures at age 6 years and a gait dis- 
turbance with peripheral neuropathy at age 16 years, 

10 Residual activity of MTHFR was 3% and the enzyme was 
thermolabile . 

Patient 73 5, an African-Indian girl, was diag- 
nosed at age 7 months with microcephaly, progressive 
deterioration of mental development, apnea and coma. 

15 Residual activity of MTHFR was 2% of control levels. 
Thermal properties were not determined. 

Patient 1084, a Caucasian male, was diagnosed 
at age 3 months with an infantile fibrosarcoma. He 
was found to be hypotonic and became apneic. He died 

20 at the age of 4 months. Residual activity of MTHFR 
was not detectable. Thermal properties were not 
determined. 

Patient 3 56, the first patient reported with 
MTHFR deficiency, is an Italian-American male who pre- 
25 sented at age 16 years with muscle weakness, abnormal 
gait and flinging movements of the upper extremities. 
MTHFR residual activity was 20% of control values; 
activity was rapidly and exponentially inactivated at 
55° . 

30 Patient 458, a Caucasian male, was diagnosed at 

age 12 years with ataxia and marginal school perform- 
ance- Residual MTHFR activity was approximately 10%, 
and the activity was thermolabile. 

Patient 1396, a Caucasian female, was described 

3 5 as clumsy and as having a global learning disorder in 
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childhood. At age 14 years, she developed ataxia, 
foot drop, and inability to walk. She developed deep 
vein thrombosis and bilateral pulmonary emboli. Resid- 
ual activity of MTHFR was 14% and the enzyme was ther- 
5 molabile. 

Genomic structure and intronic primers 

Exon nomenclature is based on available cDNA 
sequence in Goyette et al . {Nature Genetics , 1994, 

10 2*^^5-200). Exon 1 has been arbitrarily designated as 
the region of cDNA from bp 1 to the first intron. 
Identification of introns was performed by amplifica- 
tion of genomic DNA using cDNA primer sequences. PGR 
products that were greater in size than expected cDNA 

15 sizes were sequenced directly. 

Mutation detection 

Specific exons (see Table 1 for primer 
sequences ) were amplified by PGR from genomic DNA and 

20 analyzed by the SSCP protocol. SSGP was performed 
with the Phastgel™ system (Pharmacia), a non-isotopic 
rapid SSCP protocol, as previously described (Goyette 
P et al.. Nature Genetics, 1994, 2=195-200), or with 
^^S-labeled PGR products run on 6% acrylamide: 10% 

25 glycerol gels at room temperature (6 watts, over- 
night). In some cases, the use of restriction endonu- 
cleases, to cleave the PGR product before SSCP analy- 
sis, enhanced the detection of band shifts. PGR frag- 
ments with altered mobility were sequenced directly 

30 (GIBGO, Cycle Sequencing™ kit). If the sequence 
change affected a restriction endonuclease site, then 
the PGR product was digested with the appropriate 
enzyme and analyzed by PAGE. Otherwise, allele-spe- 
cific oligonucleotide (ASO) hybridization was per- 

3 5 formed on a dot blot of the PGR-amplif ied exon. 
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Tabl 1 

PCR Primers for DMA amplification and mutation 
analysis of MTHFR 



Exon 


Primer Type 


Primer Sequence (6'->3') 


Location 


Fragment 
Size (bp) 


1 


Sense 
Antisense 


AGCCTCAACCCCTGCTTGGAGG 
TGAGAGTTTGCTCCCCAGGCAC 


C 
1 


271 


4 


Sense 
Antisense 


TGAAGGAGAAGGTGTCTGCGGGA 
AGGACGGTGCGGTGAGAGTGG 


C 
1 


19S 


5 


Sense 
Antisense 


CACTGTGGTTGGCATGGATGATG 
GGCTGCTCTTGGACCCTCCTC 


1 
1 


392 


6 


Sense 
Antisense 


TGCTTCCGGCTCCCTCTAGCC 
CCTCCCGCTCCCAAGAACAAAG 


1 
! 


251 



5 

Table 2 

Summary of genotypes, enzyme activity, age at onset, 
and background of patients with MTHFR deficiency 



Patient ^ 


BPChanges^ 


Amino acid 
changes 


% 

Activity 


Age at Onset 


Background 


1807 


C764T/C764T 


Pro-* Leu/Pro-^ Leu 


3 


wrthin 1st year 


Japanese 


735 


C692T/C692T 


Thr->Met/Thr->M6t 


2 


7 months 


African Indian 


1084 


C692T/C692T 


Tlir-^Metn-hr-^Met 


0 


3 months 


Caucasian 


1554 


C559T/C559T 


Arg->Ter/Arg->Ter 


0 


1 month 


Native American 
(Hopi) 


1627 


C559T/C559T 


Arg^Ter/Arg->Ter 


1 


1 month 


Native American 
(Choctaw) 


356 


C985T/C985T 


Arg->^Cys/Arg->Cys 


20 


16yrs 


italian American 


458 


C1015T/G167A 


Arg->Cys/Arg->Gln 


10 


11 yrs 


Caucasian 


1396 


C1081T/G167A 


Arg^Cys/Arg-*GIn 


14 


14yrs 


Caucasian 


1779^ 


G482AA' 


Arg->Gln/? 


6 


15 yrs 


French Canadian 


1834^ 


G482A/? 


Arg->GIn/? 


7 


Asymptomatic 
at 37 yrs 


French Canadian 


1863 


G482A/? 


Arg-^GIn/? 


14 


21 yrs 


Caucasian 


354^ 


792 + 1G-^A/? 


5* splice site/? 


19 


13yrs 


African American 


355^ 


792 + 1G-»A/? 


5' splice site/? 


14 


11 yrs 


African American 



10 ^ Patients 1554, 1627, 1779, 1834 and 1 863 were previously reported by Goyette et al. (1994). 
^ ? = unidentified mutation. 
^ Patients 1779 and 1834 are sibs. 
^ Patients 354 and 355 are sibs. 
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( 1 ) 5' splice site mutation 

Amplification of cDNA, bp 653-939, from 
reverse-transcribed total fibroblast RNA revealed 2 
5 bands in sisters 354 and 355: a smaller PGR fragment 
(230 bp) in addition to the normal 287 bp allele 
(Fig. 8A), Fig. 8A is the PAGE analysis of amplifica- 
tion products of cDNA bp 653-93 9, from reverse tran- 
scribed RNA. Controls have the expected 287 bp frag- 

10 ment while patients 354 and 355 have an additional 230 
bp fragment- Sequencing of the smaller fragment iden- 
tified a 57 bp in-frame deletion which would remove 19 
amino acids (Fig. 8B). Fig. 8B is the direct sequenc- 
ing of the PGR products from patient 354. The 57 bp 

15 deletion spans bp 736-792 of the cDNA. An almost per- 
fect 5* splice site (boxed) is seen at the 5' deletion 
breakpoint. Analysis of the sequence at the 5' dele- 
tion breakpoint in the undeleted fragment revealed an 
almost perfect 5' splice site consensus sequence 

20 (AG/gcatgc). This observation suggested the presence 
of a splicing mutation in the natural 5* splice site 
that might activate this cryptic site, to generate the 
deleted allele- The sequence following the deletion 
breakpoint, in the mutant allele, corresponded exactly 

25 to the sequence of the next exon. Amplification of 
genomic DNA, using the same amplification primers as 
those used for reverse-transcribed RNA, generated a 
1.2 kb PGR product indicating the presence of an 
intron. Direct sequencing of this PGR fragment in 

3 0 patient 3 54 identified a heterozygous G-^A substitution 
in the conserved GT dinucleotide of the intron at the 
5* splice site (Fig. 8G), Fig, 8G is the sequencing 
of the 5' splice site in control and patient 354. The 
patient carries a heterozygous G->A substitution in the 

35 5' splice site (boxed). Intronic sequences are in 
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lower case. This substitution abolished a HphI 

restriction endonuclease site which was used to con- 
firm the mutation in the 2 sisters (Fig. 8D). Fig. 8D 
is the HphI restriction endonuclease analysis on PGR 
5 products of DNA for exon 4 of patients 354 and 355, 
and of 3 controls (C). The 198 bp PGR product has 2 
HphI sites. The products of digestion for the control 
allele are 151, 24 and 23 bp. The products of diges- 
tion for the mutant allele are 175 and 23 bp due to 
10 the loss of a HphI site. The fragments of 24 and 23 bp 
have been run off the gel. 

( 2 ) Patients with homozygous coding substitutions 

SSCP analysis of exon 4 for patient 1807 

15 revealed an abnormally-migrating fragment, which was 
directly sequenced to reveal a homozygous C— >T substi- 
tution (bp 764) converting a proline to a leucine 
residue- This change creates a Mnll restriction endo- 
nuclease site, which was used to confirm the homozy- 

20 gous state of the mutation (Fig. 9A) . Fig. 9A is the 
Mnil restriction analysis of exon 4 PGR products for 
patient 1807 and 3 controls (G). Expected fragments: 
control allele, 90, 46, 44, 18 bp; mutant allele, 73, 
46, 44, 18, 17 bp. An additional band at the bottom 

25 of the gel is the primer. Fifty independent control 
Gaucasian chromosomes and 12 control Japanese chromo- 
somes were tested by restriction analysis; all were 
negative for this mutation. Homozygosity in this pa- 
tient is probably due to the consanguinity of the 

30 parents. 

Patients 73 5 and 1084 had the same mutation in 
exon 4, in a homozygous state: a C->T substitution (bp 
692) which converted an evolutionarily-conserved 
threonine residue to a methionine residue, and abol- 
35 ished a NIalll restriction endonuclease site. Allele- 
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specific oligonucleotide hybridization to amplified 
exon 4 (Figs. lOA and lOB) was used to confirm the 
mutation in these 2 patients and to screen 60 inde- 
pendent chromosomes, all of which turned out to be 
5 negative. Fig. lOA is the hybridization of mutant 
oligonucleotide {692T) to exon 4 PGR products from 
patients 735, 1084 and 30 controls. Only DNA from 
patients 735 and 1084 hybridized to this probe. 
Fig, lOB is the hybridization of normal oligonucleo- 

10 tide (692C) to stripped dot blot from A. All control 
DNAs hybridized to this probe. 

Patient 356 showed a shift on SSCP analysis of 
exon 5. Direct sequencing revealed a homozygous C— >T 
substitution (bp 985) which converted an evolutionar- 

15 ily-conserved arginine residue to cysteine; the sub- 
stitution abolished an Acil restriction endonuclease 
site. This was used to confirm the homozygous state 
of the mutation in patient 3 56 (Fig. 9B) and its pres- 
ence in the heterozygous state in both parents- Fifty 

20 independent control chromosomes, tested in the same 
manner, were negative for this mutation. Fig. 9B is 
the Acil restriction analysis of exon 5 PGR products 
for patient 356, his father (F), his mother (M), and 3 
controls (G) . Expected fragments: control allele, 

25 129, 105, 90, 68 bp; mutant allele, 195, 129, 68 bp, 

( 3 ) Patients who are genetic compounds 

Patient 458 is a compound heterozygote of a 
mutation in exon 5 and a mutation in exon 1. The exon 

30 5 substitution (C-^T at bp 1015) resulted in the sub- 
stitution of a cysteine residue for an arginine resi- 
due; this abolished a Hhal restriction endonuclease 
site, which was used to confirm the mutation in 
patient 4 58 (Fig. 9C) and to show that 5 0 control 

35 chromosomes were negative- Fig, 9C is the Hhal 
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restriction analysis of exon 5 PCR products for 
patient 458 and 4 controls (C) . Expected fragments: 
control allele, 317 and 75 bp? mutant allele 392 bp. 
The 7 5 bp fragment is not shown in Fig. 9C. The 
5 second mutation was a heterozygous G— >A substitution 
(bp 167) converting an arginine to a glutamine resi- 
due. Allele-specif ic oligonucleotide hybridization to 
amplified exon 1 confirmed the heterozygous state of 
this mutation in patient 4 58 and identified a second 

10 patient (1396) carrying this mutation also in the 
heterozygous state (Figs. IOC and lOD) . Fig. IOC is 
the hybridization of mutant oligonucleotide (167A) to 
exon 1 PCR products from patients 458, 1396 and 31 
controls. Fig. lOD is the hybridization of normal 

15 oligonucleotide (167G) to stripped dot blot from C- 
None of the 62 control chromosomes carried this muta- 
tion . 

The second mutation in patient 1396 was identi- 
fied in exon 6: a heterozygous C— >T substitution (bp 

20 1081) that converted an arginine residue to a cysteine 
residue, and abolished a Hhal restriction endonuclease 
site. Restriction analysis confirmed the heterozygous 
substitution in 1396 (Fig. 9D) and showed that 50 con- 
trol chromosomes were negative- Fig. 9D is the Hhal 

25 restriction analysis of exon 6 PCR products for 
patient 1396 and 2 controls (C). Expected fragments: 
control allele, 152, 86, 13 bp; mutant allele 165, 86 
bp. The 13 bp fragment has been run off the gel. 

30 (4) Additional sequence changes 

Hhal analysis of exon 6, mentioned above, 
revealed a DNA polymorphism. This change is a T-»C 
substitution at bp 1068 which does not alter the amino 
acid (serine), but creates a Hhal recognition site. T 

35 at bp 1068 was found in 9% of tested chromosomes. 
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Sequence analysis identified 2 discrepancies with the 
published cDNA sequence: a G— >A substitution at bp 542 
which converts the glycine to an aspartate codon, and 
a C-^T change at bp 103 2 which does not alter the amino 
5 acid (threonine). Since all DNAs tested (>50 chromo- 
somes) carried the A at bp 542 and the T at bp 1032, 
it is likely that the sequence of the original cDNA 
contained some cloning artifacts. 

10 Genotype rphenotype correlation 

Table 2 summarizes the current status of muta- 
tions in severe MTHFR deficiency. In 8 patients, both 
mutations have been identified; in 5 patients (3 fami- 
lies), only 1 mutation has been identified. Overall 

15 the correlation between the genotype, enzyme activity, 
and phenotype is quite consistent. Five patients, 
with onset of symptoms within the first year of life, 
had < 3% of control activity- Three of these patients 
had missense mutations in the homozygous state: two 

20 patients with the threonine to methionine substitution 
(C692T) and one patient with the proline to leucine 
substitution (C764T). The nonsense mutation (C559T) 
in the homozygous state in patients 1554 and 1627 
(previously reported in Goyette P et al.. Nature 

25 Genetics, 1994, 2=195-200) is also associated with a 
neonatal severe form, as expected. 

The other patients in Table 2 had >. 6% of con- 
trol activity and onset of symptoms within or after 
the 2nd decade of life; the only exception is patient 

30 1834, as previously reported (Goyette P et al.. Nature 
Genetics, 1994, 7:195-200). The three patients (356, 
458 and 1396) with missense mutations (G167A, C985T, 
C1015T and C1081T) are similar to those previously 
reported (patients 1779, 1834 and 1863) who had an 

3 5 arginine to glutamine substitution and a second 
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unidentified mutation (Goyette P et al . , Nature Genet- 
ics, 1994, 2:195"200). The sisters with the 5* splice 
mutation and an unidentified second mutation also had 
levels of activity in the same range and onset of 
5 symptoms in the second decade, but the activity is 
likely due to the second unidentified allele. 

Discussion 

The patients come from diverse ethnic back- 

10 grounds- Although patients 1554 and 1627 are both 
Native Americans, the mutations occur on different 
haplotypes, suggesting recurrent mutation rather than 
identity by descent. Since the substitution occurs in 
a CpG dinucleotide, a "hot spot" for mutation, recur- 

15 rent mutation is a reasonable hypothesis- It is dif- 
ficult to assess whether some mutations are popula- 
tion-specific since the numbers are too small at the 
present time. 

MTHFR deficiency is the most common inborn 

20 error of folate metabolism, and a major cause of 
hereditary homocysteinemia . The recent isolation of a 
cDNA for MTHFR has permitted mutational analysis at 
this locus, with the aims of defining important 
domains for the enzyme and of correlating genotype 

25 with phenotype in MTHFR-def icient patients. 

Our definition of a disease-causing substitu- 
tion, as distinct from a benign polymorphism, is based 
on 3 factors: (1) absence of the change in at least 50 
independent control chromosomes; (2) presence of the 

30 amino acid in the bacterial enzyme, attesting to its 
evolutionary significance and (3) whether the change 
in amino acid is conservative- Although expression of 
the substitutions is required to formally prove that 
they are not benign, the criteria above allow us to 
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postulate that the changes described in this report 
are likely to affect activity. 

The 7 mutations described here (6 single amino 
acid substitutions and a 5' splice site mutation) 
5 bring the total to 9 mutations identified thus far in 
severe MTHFR deficiency and complete the mutation 
analysis for 8 patients. The identification of each 
mutation in only one or two families points to the 
striking degree of genetic heterogeneity at this 
10 locus. Seven of the 9 mutations are located in CpG 
dinucleotides , which are prone to mutational events. 

5 ' splice site mutation 

The G->A substitution at the GT dinucleotide of 

15 the 5* splice site in patients 354 and 355 results in 
a 57bp in-frame deletion of the coding sequence, which 
should delete 19 amino acids of the protein. This 
deletion occurs as a result of the activation of a 
cryptic 5* splice site (AG/gc) even though this cryp- 

20 tic site does not have a perfect 5* splice site con- 
sensus sequence (AG/gt). However, GC (instead of GT ) 
as the first 2 nucleotides of an intron have been 
reported in several naturally-occurring splice sites, 
such as in the genes for human prothrombin and human 

25 adenine phosphoribosyltransf erase and twice within the 
gene for the largest subunit of mouse RNA polymerase 
II. The remaining nucleotides of the cryptic site 
conform to a normal splice site consensus sequence 
with its expected variations ^-^60 

30 G79/gi00tl00^59^7l982t50 ) • is unlikely that the 

deleted enzyme resulting from this aberrantly-spliced 
mRNA would have any activity; 8 of the 19 deleted 
amino acids are conserved in the E. coli enzyme. 
Although the 2 patients show residual enzyme activity 

35 in the range of 20% of controls, the activity is prob- 
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ably due to the unidentified second allele in these 
patients . 

6 missense mutations 

5 The Arg->Cys substitution (C1081T) in patient 

1396 is within a hydrophilic sequence previously pos- 
tulated to be the linker region between the catalytic 
and regulatory domains of MTHFR (Goyette P et al.. 
Nature Genetics, 1994, 2=195-200), These 2 domains 

10 are readily separable by mild trypsinization of the 
porcine enzyme. The linker domain, a highly-charged 
region, is likely to be located on the outside surface 
of the protein and therefore more accessible to prote- 
olysis- Because the Arg— >Cys substitution converts a 

15 charged hydrophilic residue to an uncharged polar 
residue, it cannot be considered a conservative 
change, and could affect the stability of the enzyme. 

The 2 Arg^Cys substitutions identified in 
patients 356 and 458 (C985T and C1015T, respectively) 

20 may be involved in binding the FAD cof actor. Previ- 
ous work in the literature showed that heating fibro- 
blast extracts at 5 5*", in the absence of the FAD 
cof actor, inactivated MTHFR completely. The addition 
of FAD to the reaction mixture before heat inactiva- 

2 5 tion restored some enzyme activity to control extracts 

and to extracts from some patients, while the extracts 
of patients 356 and 458 were unaffected. Based on 
these observations, it was suggested that these 2 
patients had mutations affecting a region of the pro- 

3 0 tein involved in binding FAD, The 2 mutations are 

found in close proximity to each other, within 11 
amino acids. In patient 356, the Arg residue is evo- 
lutionarily-conserved in E. coli and is found in a 
stretch of 9 conserved amino acids, suggesting a 
35 critical role for this residue; the altered Arg resi- 
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due in patient 458 is not evolutionarily-conserved. 
Crystal structure analysis of medium chain acyl-CoA 
dehydrogenase (MCAD), a f lavoprotein, has defined 
critical residues involved in the binding of FAD. Two 
5 consecutive residues of the MCAD protein, Metl65 and 
Trpl66, involved in interactions with FAD, can also be 
identified in MTHFR, 3 and 4 amino acids downstream, 
respectively, from the Arg residue altered in patient 
458, 

10 The Thr->Met substitution (C692T), is found in 

a region of high conservation with the E. coll enzyme 
and in a region of good homology with human dihydro- 
folate reductase (DHFR) (Fig. 11). in Fig. 11, = is 
identity; • is homology; and O is identity to bovine 

15 DHFR enzyme. An asterisk (*) indicates location of 
Thr-»Met substitution. Considering the early-onset 
phenotype of the patients, one can assume that the 
threonine residue is critical for activity or that it 
contributes to an important domain of the protein . 

20 This region of homology in DHFR contains a residue, 
Thrl3 6, which has been reported to be involved in 
folate binding. This Thr residue in DHFR aligns with 
a Ser residue in MTHFR, an amino acid with similar 
biochemical properties. The Thr — > Met substitution is 

25 located 8 amino acids downstream from this Ser codon, 
in the center of the region of homology between the 2 
enzymes. We therefore hypothesize that the Thr -> Met 
substitution may alter the binding of the folate sub- 
strate , 

30 The G167A (Arg -> Gin) and C764T (Pro -> Leu) 

substitutions both affect non-conserved amino acids. 
Their importance in the development of MTHFR defi- 
ciency cannot be determined at the present time. All 
the mutations identified thus far are located in the 

35 5' end of the coding sequence, the region thought to 



BNSDOCIO: <WO 9533054A1 _!_> 



wo 95/33054 



PCT/CA95/00314 



encode the catalytic domain of MTHFR. Mutation analy- 
sis has been useful in beginning to address the struc- 
ture: function properties of the enzyme as well as to 
understand the diverse phenotypes in severe MTHFR 
5 deficiency. 

III. Identification of A->V mutation 

SSCP analysis and direct sequencing of PGR 
fragments were used to identify a C to T substitution 

10 at bp 677, which converts an alanine residue to a 
valine residue (Fig- 5A) . The primers for analysis of 
the A-->V change are; 5 ' -TGAAGGAGAA GGTGTCTGCG GGA-3 ' 
( exonic ) and 5 ' -AGGACGGTGC GGTGAGAGTG-3 ' ( intronic ) ; 
these primers generate a fragment of 198 bp. Fig. 5A 

15 depicts the sequence of two individuals, a homozygote 
for the alanine residue and a homozygote for the 
valine residue. The antisense strands are depicted. 
This alteration creates a Hinfl site (Fig. 5B), which 
was used to screen 114 unselected French Canadian 

20 chromosomes; the allele frequency of the substitution 
was .38. The substitution creates a Hinfl recognition 
sequence which digests the 19 8 bp fragment into a 175 
bp and a 23 bp fragment; the latter fragment has been 
run off the gel. Fig. 5B depicts the three possible 

25 genotypes. The frequency of the 3 genotypes were as 
follows: -/-, 37%; +/-, 51%; and +/+, 12% (the ( + ) 
indicates the presence of the Hinfl restriction site 
and a valine residue). 

The alanine residue is conserved in porcine 

30 MTHFR, as well as in the corresponding metF and 
stymetF genes of E. coli and S, typhimurium, respec- 
tively. The strong degree of conservation of this 
residue, and its location in a region of high homology 
with the bacterial enzymes, alluded to its importance 

35 in enzyme structure or function. Furthermore, the 
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frequency of the (+/+) genotype was consistent with 
the frequency of the thermolabile MTHFR variant impli- 
cated in vascular disease - 

5 Clinical material 

To determine the frequency of the A->V muta- 
tion, DNA from 57 individuals from Quebec was analyzed 
by PGR and restriction digestion. The individuals, 
who were all French Canadian, were not examined clini- 

10 cally or biochemically - 

The 40 individuals analyzed in Table 3 had been 
previously described in Engbersen et al . {Am. J", Hum. 
Genet., 1995, 16:142-150). Of the 13 cardiovascular 
patients, 8 had cerebrovascular arteriosclerosis and 5 

15 had peripheral arteriosclerosis. Five had thermo- 
labile MTHFR while 8 had thermostable MTHFR (greater 
than 33% residual activity after heating). Controls 
and patients were all Dutch-Caucasian, between 20-60 
years of age. None of these individuals used vitamins 

20 which could alter homocysteine levels. Enzyme assays 
and homocysteine determinations were also reported by 
Engbersen et al. {Am. J. Hum. Genet., 1995, 56 : 142- 
150) . 
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Table 3 

Correlation betwe n MTHPR g notyp and nzyme activ 
ity, thermolability and plasma homocys-teine level 

5 





-/- 


+/- 


+/+ 




n=19 


n=9 


n=12 


specific activity^ib 


22.9±1.7 


15.0 ±0.8 


6.9 ± 0.6 


(nmol CH20/mg.protein/hr) 


(11.8-33.8) 


(10.2-18.8) 


(2.6-10.2) 


residual activity after 


66.8 ±1.5 


56.2 ± 2.8 


21 .8 ±2.8 


heating^'** (%) 


(55-76) 


(41-67) 


(10-35) 


plasma homocysteine ^»*^ 


12.6 ±1.1 


13.8 ± 1.0 


22.4 ± 2.9 


(pM)(after fasting) 


(7-21) 


(9.6-20) 


(9.6-42) 


plasma homocysteine 


41.3±5.0d 


41 ±2.8 


72.6 ± 11.7® 


(MM)(post-methionine load) 


(20.9 -110) 


(29.1-54) 


(24.4-159) 



3 one-way anova p<.01 

t> paired t test for alt combinations p<.01 

c paired t test p<.05 for +/+ group versus +/- group or -/- group; p>.05 for +/- versus 
-/- group. 
10 ^ n=1 8 for this parameter 
® n=11 for this parameter 

Enzyme activity and plasma homocysteine were 
determined as previously reported. Each value repre- 
sents mean ± standard error. The range is given in 
15 parentheses below the mean. 

Correlation of A-»V mutation with altered MTHPR func- 
tion 

A genotypic analysis was performed and enzyme 
20 activity and thermolability were measured in a total 
of 40 lymphocyte pellets from patients with premature 
vascular disease and controls. 13 vascular patients 
were selected from a previous study (Engbersen et al.. 
Am. J. Hum. Genet., 1995, 142-150 ) , among which 5 

25 were considered to have thermolabile MTHFR, From a 
large reference group of 89 controls, all 7 
individuals who had thermolabile MTHFR were studied, 
and an additional 2 0 controls with normal MTHFR were 
selected from the same reference group. Table 3 
30 documents the relationship between genotypes and spe- 
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cific enzyme activity, thermolability and plasma homo- 
cysteine level. The mean MTHFR activity for individu- 
als homozygous for the substitution (+/+) was approxi- 
mately 3 0% of the mean activity for (-/-) individuals, 
5 homozygous for the alanine residue. Heterozygotes had 
a mean MTHFR activity that was 65% of the activity of 
(-/-) individuals; this value is intermediate between 
the values for ( -/- ) and (+/+) individuals. The 
ranges of activities showed some overlap for the 

10 heterozygous and (-/-) genotypes, but homozygous (+/+) 
individuals showed virtually no overlap with the for- 
mer groups, A one-way analysis of variance yielded a 
p value <.0001; a pairwise Bonferroni t test showed 
that all three genotypes were significantly different 

15 with p<0.01 for the three possible combinations - 

The three genotypes were all significantly dif- 
ferent (p<.01) with respect to enzyme thermolability . 
The mean residual activity after heat inactivation for 
5 minutes at 46^ was 67%, 56% and 22% for the (-/-), 

20 (+/-) and (+/+) genotypes, respectively. While the 
degree of thermolability overlaps somewhat for (-/-) 
individuals and heterozygotes, individuals with two 
mutant alleles had a distinctly lower range. Every 
individual with the (+/+) genotype had residual activ- 

25 ity <35% after heating, and specific activity <50% of 
that of the (-/-) genotype. 

Total homocysteine concentrations, after fast- 
ing and 6 hours after methionine loading, were meas- 
ured in plasma by high performance liquid chromatogra- 

3 0 phy using fluorescence detection. Fasting homocysteine 
levels in (+/+) individuals were almost twice the 
value for (+/-) and (-/-) individuals. The differ- 
ences among genotypes for plasma homocysteine were 
maintained when homocysteine was measured following 6 

35 hours of methionine loading. A one-way anova yielded 
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a p < .01 for the fasting and post-methionine homocys- 
teine levels- A pairwise Bonferroni t test showed 
that only homozygous mutant individuals had signifi- 
cantly elevated homocysteine levels (p<.05). 

5 

PCR-based mutagenesis for expression of A— >V mutation 

in vitxo 

PCR-based mutagenesis, using the cDNA-contain- 
ing Bluescript™ vector as template, was used to create 

10 the A to V mutation. Vent™ polymerase (NEB) was used 
to reduce PGR errors. The following primers were 
used: primer 1, bp -200 to -178, sense; primer 2, bp 
667 to 687, antisense, containing a mismatch. A, at bp 
677; primer 3, 667 to 687, sense, containing a mis- 

15 match, T, at bp 677; primer 4, bp 1092 to 1114, 
antisense. PGR was performed using primers 1 and 2 to 
generate a product of 887 bp, and using primers 3 and 
4 to generate a product of 447 bp. The two PGR frag- 
ments were isolated from a 1,2% agarose gel by Gene- 

20 clean™ (BIO 101), A final PGR reaction, using primers 
1 and 4 and the first 2 PGR fragments as template, was 
performed to generate a 1,3 kb band containing the 
mutation. The 1.3 kb fragment was digested with Ncol 
and MscI, and inserted into the wild-type cDNA- con- 

25 taining expression vector by replacing the sequences 
between the Ncol site at bp 11 and the MscI site at bp 
943. The entire replacement fragment and the cloning 
sites were sequenced to verify that no additional 
changes were introduced by PGR. 

30 

Expression analysis of wild-type and mutagenized cDNA 

Overnight cultures of JMIOS™ containing vector 
alone, vector + wild-type MTHFR cDNA, or vector + 
mutagenized cDNA were grown at 37** G. in 2 x YT media 
3 5 with .05 mg/ml ampicillin. Fresh 10 ml. cultures of 
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each were inoculated with approximately 50 laL of over- 
night cultures for a starting 0-D. of 0.05, and were 
grown at 37 C to an O.D. of 1 at 420 nM. Cultures 
were then induced for 2 hrs . with 1 mM IPTG and pel- 
5 letted. The cells were resuspended in TE buffer with 
2 (jg/ml aprotinin and 2|ag/ml leupeptin (3.5 x wet 
weight of cells ) .Cell suspensions were sonicated on 
ice for 3 x 15 sec. to break open cell membranes and 
then centrifuged for 30 mins. at 4*'C. to pellet cell 

10 debris and unlysed cells. The supernatant was removed 
and assayed for protein concentration with the Bio-Rad 
™ protein assay. Western analysis was performed using 
the Amersham ECL™ kit according to the instructions 
of the supplier, using antiserum generated against 

15 purified porcine liver MTHFR. Enzymatic assays were 
performed by established procedures; thermolability 
was assessed by pre-treating the extracts at 46**C. for 
5 mins. before determining activity. Specific activi- 
ties ( nmol f ormaldehyde/hr . /mg . protein ) were calcu- 

20 lated for the 2 cDNA-containing constructs after sub- 
traction of the values obtained with vector alone (to 
subtract background E, coll MTHFR activity) - 

The MTHFR cDNA (2.2 kb ) ( Fig . 6 ) has an open 
reading frame of 1980 bp, predicting a protein of 74.6 

25 kDa. The purified porcine liver enzyme has been shown 
to have subunits of 77 kDa. Western analysis 

(Fig. 7A) of several human tissues and of porcine 
liver has revealed a polypeptide of 77 kDa in all the 
studied tissues, as well as an additional polypeptide 

30 of approximately 7 0 kDa in human fetal liver and in 
porcine liver, suggesting the presence of isozymes. 
Two [xg of bacterial extract protein was used for lanes 
1-3. The tissues (lanes 4-8) were prepared by 
homogenization in -25M sucrose with protease 

35 inhibitors (2 [iq/ml each of aprotinin and leupeptin), 
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followed by sonication (3 x 15 sec.) on ice. The 
extracts were spun for 15 min. in a microcentrifuge at 
14, 000 g and 100 jag of supernatant protein was used 
for Western analysis. h=human; p=porcine. 
5 The wild-type cDNA and a mutagenized cDNA, con- 

taining the A-->V substitution, were expressed in E. 
coli to yield a protein of approximately 70 kDa 
(Fig. 7A), which co-migrates with the smaller polypep- 
tide mentioned above- Treatment of extracts at 4 6 °C 

10 for 5 minutes revealed that the enzyme containing the 
substitution was significantly more thermolabile than 
the wild-type enzyme {p<.001; Fig. 7B). Two separate 
experiments (with 3-4 replicates for each construct 
for each experiment) were performed to measure thermo- 

15 stable activity of the wild-type MTHFR and mutagenized 
MTHFR A— >V cDNAs- The values shown represent mean ± 
standard error for each experiment, as % of residual 
activity after heating. The means of the specific 
activities before heating (expressed as nmol formalde- 

20 hyde/hr./mg. protein) were as follows: Exp. 1, 3.8 and 
5.3 for MTHFR and MTHFR A->V, respectively; Exp. 2, 6.2 
and 7.5 for MTHFR and MTHFR A-^V, respectively. The 
expression experiments were not designed to measure 
differences in specific activity before heating, since 

25 variation in efficiencies of expression could contrib- 
ute to difficulties in interpretation. Curiously 
though, the specific activity for the mutant construct 
was higher in both experiments. It is possible that 
the mutant protein has increased stability in E. coli, 

30 or that inclusion bodies in our extracts contributed 
to differences in recovery of properly-assembled 
enzyme . 

These studies have identified a common substi- 
tution in the MTHFR gene which results in thermolabil- 
3 5 ity in vitro and in vivo. The mutation, in the 
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heterozygous or homozygous state, correlates with 
reduced enzyme activity and increased thermolability 
of MTHFR in lymphocyte extracts. A significant eleva- 
tion in plasma homocysteine was observed in individu- 
5 als who were homozygous for the mutation. Statisti- 
cally-significant differences for homocysteine levels 
were not observed between heterozygotes and (-/-) 
individuals; this observation is not surprising, since 
plasma homocysteine can be influenced by several envi- 

10 ronmental factors, including intake of folate, vitamin 
^12' vitamin B5, and methionine, as well as by genetic 
variation at other loci, such as the cystathionine-B- 
synthase gene. 

The alanine to valine substitution conserves 

15 the hydrophobicity of the residue and is associated 
with small changes in activity, in contrast to non- 
conservative changes, such as the previously-reported 
arginine to glutamine change in MTHFR, which is asso- 
ciated with a greater decrease in enzyme activity and 

20 severe hyperhomocysteinemia . The alanine residue is 
situated in a region of homology with the bacterial 
metP genes. We have also observed the same region of 
homology in the human dihydrof olate reductase (DHFR) 
gene (Fig. 11), although the alanine residue itself is 

25 not conserved; this region of amino acids 130-149 of 
DHFR contains T136 which has been implicated in folate 
binding in an analysis of the crystal structure of 
recombinant human DHFR. It is tempting to speculate 
that this region in MTHFR is also involved in folate 

3Q binding and that the enzyme may be stabilized in the 
presence of folate. This hypothesis is compatible 
with the well-documented influence of folate on homo- 
cysteine levels and with the reported correction of 
mild hyperhomocysteinemia by folic acid in individuals 
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with premature vascular disease, and in individuals 
with thermolabile MTHFR. 

Although our cDNA is not long enough to encode 
the larger MTHFR polypeptide, it is capable of direct- 
5 ing synthesis of the smaller isozyme. The ATG start 
codon for this polypeptide is within a good consensus 
sequence for translation initiation- Whether the 
isozyme is restricted to liver and what its role is in 
this tissue remain to be determined. 

10 These data have identified a common genetic 

change in MTHFR which results in thermolability ; our 
experiments do not directly address the relationship 
between this change and vascular disease. Nonethe- 
less, this polymorphism represents a diagnostic test 

15 for evaluation of MTHFR thermolability in hyperhomo- 
cysteinemia. Large case-control studies are required 
to evaluate the frequency of this genetic change in 
various forms of occlusive arterial disease and to 
examine the interaction between this genetic marker 

20 and dietary factors. Well-defined populations need to 
be examined, since the limited data set thus far sug- 
gests that population-specific allele frequencies may 
exist. More importantly, however, the identification 
of a candidate genetic risk factor for vascular dis- 

25 ease, which may be influenced by nutrient intake, rep- 
resents a critical step in the design of appropriate 
therapies for the homocysteinemic form of arterioscle- 
rosis. 

30 cDNA FOR MTHFR AND ITS POTENTIAL UTILITY 

A human cDNA for MTHFR (2.2 kb) has been iso- 
lated, as reported by us in Goyette et al. {Nature 
Genetics, 1994, 2:195-200) and Frosst et al. {Nature 
Genetics, 1995, 10:111-113). The cDNA has been 
3 5 expressed in v^itro to yield a MTHFR protein of 
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approximately 70 kDa (Frosst P et al,. Nature Genet- 
ics, 1995, 10;111-113) . 

Using the cDNA sequence, mutations in patients 
with severe and mild MTHFR deficiency (Goyette P et 
5 al.. Nature Genetics, 1994, 2:195-200; Goyette P et 
al.. Am. J. Hum. Genet., 1995, 56:1052-1059; Frosst P 
et al.. Nature Genetics, 1995, 111-113) were iden- 

tified. 

The cDNA sequence is a necessary starting point 
10 for the detection of MTHFR sequence changes that would 
identify individuals at risk for cardiovascular and 
neurological diseases, as well as other disorders 
affected by folic acid metabolism. Diagnostic tests by 
DNA analysis are more efficient and accurate than 
15 testing by enzymatic/biochemical assays. Less blood is 
required and results are available in a shorter period 
of time. The tests could be performed as a routine 
operation in any laboratory that performs molecular 
genetic diagnosis, without the specialized 

20 reagents/expertise that is required for an enzyme- 
based test. 

The second major utility of the cDNA would be 
in the design of therapeutic protocols, for correction 
of MTHFF deficiency. These protocols could directly 

25 involve the gene, as in gene therapy trials or in the 
use of reagents that could modify gene expression. 
Alternatively, the therapy might require knowledge of 
the amino acid sequence (derived from the cDNA 
sequence), as in the use of reagents that would modify 

30 enzyme activity. The identification of sequences 
and/or sequence changes in specific regions of the 
cDNA or protein, such as FAD binding sites or folate- 
binding sites, are useful in designing therapeutic 
protocols involving the above nutrients. 



35 
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UTILITY OF INVENTION IN CLINICAL AND DIAGNOSTIC 
STUDIES 

Coronary artery disease patients in Montreal 
(n=153) were studied to examine the frequency of the 
5 alanine to valine substitution. Fourteen percent of 
the patients were homozygous for this mutation- An 
analysis of 70 control individuals (free of cardiovas- 
cular disease) demonstrated that only seven % of these 
individuals were homozygous for the alanine to valine 
10 mutation. 

Analysis of homocysteine levels in 123 men of 
the above patient group indicated that the mutant 
allele significantly raised homocysteine levels from 
10.2 micromoles/L in homozygous normal men to 11.5 and 
15 12.7 in heterozygotes and homozygous mutants, respec- 
tively . 

Families with a child with spina bifida, a 
neural tube defect, have been examined for the 
presence of the alanine to valine mutation. 

20 Approximately 16% of mothers who had a child with 
spina bifida were homozygous for this mutation, while 
only 5% of control individuals were homozygous. 
Fathers of children with spina bifida also had an 
increased prevalence of the homozygous mutant genotype 

25 (10%) as did the affected children themselves (13%). 

Table 4 indicates the interactive effect of 
folic acid with the homozygous mutant alanine to 
valine change. In a study of families from Framingham, 
Massachusetts and Utah, individuals who were 

30 homozygous mutant but had folate levels above 5 ng/ml 
did not have increased homocysteine levels compared to 
individuals with the normal or heterozygous genotype. 
However, individuals who were homozygous mutant but 
had folate levels below 5 ng/ml had homocysteine 

35 levels that were significantly higher than the other 
genotypes . 
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Table 4 

Mean fasting and PML homocysteine levels for different 

MTHFR genotypes 



MTHFR genotype 



Plasma Homocysteine 


Nomnals (-/-) 


Heterozygotes 
(+/.) 


Homozygotes 


^trend 


N 


58 


61 


30 




Fasting* 


9.4 


9.2 


12.1 


0.02 


Folate <5 ng/mL 


10.2 


10.4 


15.2 


0.002 


Folate ^ 5ng/mL 


8.2 


7.5 


7.5 


0.52 


Post-Methionine toad 


30.0 


30.9 


31.3 


0.62 



5 * Significant interaction between folate levels and genotype (p=0,03) 

Example III provides preliminary data for 
therapeutic intervention by folic acid supplementation 
to individuals who are homozygous for the alanine to 

10 valine change- The data suggest that higher levels of 
plasma folate would lead to normalization of homocys- 
teine levels in mutant individuals and might prevent 
the occurrence of disorders associated with high homo- 
cysteine levels, such as cardiovascular disease, neu- 

15 ral tube defects, and possibly other disorders- Folic 
acid supplementation for mutant individuals might also 
restore methionine and S-adenosylmethionine levels to 
normal. This would be relevant for disorders that are 
influenced by methylation, such as neoplasias, devel- 

20 opmental anomalies, neurologic disease, etc. 

While the invention has been described in con- 
nection with specific embodiments thereof, it will be 
understood that it is capable of further modifications 
and this application is intended to cover any varia- 

25 tions, uses, or adaptations of the invention follow- 
ing, in general, the principles of the invention and 
including such departures from the present disclosure 
as come within known or customary practice within the 
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art to which the invention pertains and as may be 
applied to the essential features hereinbefore set 
forth, and as follows in the scope of the appended 
claims . 
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WE CLAIM ; 

1 . A cDNA probe for human methylenetetrahydro- 
folate reductase (MTHFR) . 

2. Use of the probe of claim 1 for the iden- 
tification of sequence abnormalities in patients with 
severe or mild MTHFR deficiency. 

3. The use of claim 2, wherein said deficiency is 
selected from the group consisting of cardiovascular 
and neurological disorders and disorders influenced by 
folic acid metabolism. 

4. Use of the probe of claim 1 for gene therapy to 
produce the MTHFR protein. 

5. A human MTHFR protein which is encoded by the 
probe of claim 1. 

6. Use of the probe of claim 1 and the protein of 
claim 4 for therapy of MTHFR-def iciency patients by 
dietary, biochemical or pharmacological approaches. 
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1 0 2 0 3 0 U 50 

It XX XXX X XXX 

AAT TC[ GGA 6CC ATG GT6 AAC OAA GCC AGA GGA AAC AGC A6C (TC AAC CCC 
TIA AGG CCT CGG TAC [AC TTG CTT CGG TCI CCT ITG TCG ICG GAG TTG GGG 
Asn Ser Gly Ala Met Val Asn Glu Ala Arg Gly Asn Ser Ser leu Asn Pro> 



60 70 80 90 100 

X X X XX X XX X X 

TGC TIG GAG GGC AGT GCC AGC AGT GGC AGT GAG AGC TCC AAA GAT AGT TCG 

ACG AAC CTC CCG TCA CGG TCG TCA CCG TCA CTC TCG AGG TTT CTA TCA AGC 

Cys leu Glu Gly Ser Ala Ser Ser Gly Ser Glu Ser Ser lys Asp Ser Ser» 



110 170 no no iso 

X X XX X XX X XX 

AGA TGT TCC ACC CCG GGC CTG GAC CCT GAG CGG CAT GAG AGA CTC CGG GAG 

TCT ACA AGG TGG GGC CCG GAC CTG GGA CTC GCC GIA CTC TCT GAG GCC CTC 

Arg Cys Ser Thr Pro Gly leu Asp Pro Glu Arg His Glu Arg leu Arg Giu> 



160 no 1 8 0 1 90 20 0 

X XX X XX X XX X 

AAG ATG AGG CGG CQA TTG GAA TCT GGT GAC AAG TGG TTC TCC CTG GAA TTC 

TTC TAC TCC GCC GCT AAC CTT AGA CCA CTG TTC ACC AAG AGG GAC CTT AAG 

lys Hel Arg Arg Arg leu Glu Ser Gly Asp lys Trp PIte Ser leo Glu Phe> 



210 220 230 

XX X XX X 

TTC CCT CCT C6A ACT GCT GAG GGA GCT 

AAG GGA GGA GCT TGA CGA CTC CCT CGA 

Phe Pro Pro Arg Thr Ala Glu Gly Ala 



2U 250 

XX X XX 

GT[ AAT CTC ATC TCA AGG TTT GAC 

CAG TTA GAG TAG AGT TCC AAA CTG 

Val Asn leu lie Ser Arg Phe Asp» 



260 270 2 

X X X X 

COG ATG GCA GCA GGT GGC CCC CTC 

GCC TAC CGT [GT CCA CCG GGG GAG 

Arg He! Ala Ala Gly Gly Pro leu 

3 1 0 320 3 30 

XX X XX 

GGT GAC CCT GGC TCA GAC AAG GAG 

CCA CTG GGA CCG AGT CTG TTC CU 

Oly Asp Pro Gly Ser Asp lys Glu 



10 290 300 

XX X XX X 

TAC ATA GAC GTG ACC TGG CAC CCA GCA 

ATG TAT CTG CAC TGG ACC GTG GGT CCT 

Tyr lie Asp Val Thr Trp His Pro Ala. 



3U 350 

X X X X X 

ACC TCC TCC ATG ATG ATC GCC AGC ACC 

TGG AGG AGG TAC TAC TAG CGG TCG TGG 

Thr Ser Ser Met Hel He Ala Ser Thr. 
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3 6 0 3 7 0 3 8 0 3 9 0 UO 

X X XX X XX X XX 

GCC GIG AA( TAC TGT 6GC CTG GAG ACC ATC CIG CAC ATG ACC TGC TGt (GT 

CGG CAC ITG AIG ACA CCG CAC CU TGG lAG 6AC GIG TAC TGG ACG AC6 GCA 

Ala Val Asn Tyr Cys Gly leu Glu Thr lie leu His Net Thr Cys (ys Arg> 



410 UO UO UO 4S0 

X XX X XX X XX X 

CAG CGC CTG GAG GAG ATC ACG GGC CAT CTG CAC AAA GCT AAG CAG CTG GGC 

GTC GCG GAC CTC CTC TAG TGC CCG GTA GAC GIG TTT CGA TTC GTC GAC CCG 

Gin Arg leu Glu Glu lie Thr Gly His leu His lys Ala Lys Gin leu Giy> 



UO UO 4B0 490 500 S10 

CTG AAG AAC ATC ATG GCG CTG CGG GGA GAC CCA ATA GGT GAC CAG TGG GAA 

GAC TTC TTG TAC TAC CGC GAC GCC CCT CTG GGT TAT CCA CTG GTC ACC CTT 

leu Lys Asn He Met Ala leu Arg Gly Asp Pro He Gly Asp Gin Trp Glu> 



S?0 S30 540 550 560 

tlx X X X X X t X 

GAG GAG GAG GGA GGC TTC AAC TAC GCA GTG GAC CTG GTG AAG CAC ATC CGA 

CTC CTC CTC CCT CCG AAG TTG ATG CGT CAC CTG GAC CAC TTC GTG TAG GCT 

Glu Glu Glu Gly Gly Ptie Asn Tyr Ala Val Asp leu Val lys His lie Arg> 



510 580 590 600 610 

>> I tl X XX I X 

ACT GAG TTT CGI GAC TAC TTT GAC ATC TGI GIG GCA GOT TAC CCC AAA GGC 

TCA CTC AAA CCA CTC AIG AAA CTG TAG ACA CAC CGT CCA AIG GGC TTT CCG 

Ser Glu Ptie Gly Asp lyr Ptie Asp He Cys Val Ala Gly Tyr Pro lys Gly» 



620 630 640 650 660 

X X XX X XX X XX 

CAC CCC GAA GCA GGC ACC ITT GAG GCT GAC CTG AAC CAC TIG AAG GAG AAG 

CTG CCG CTT CGI CCC ICG AAA CTC CGA CIG GAC ITC GTG AAC TTC CTC TTC 

His Pro Glu Ala Gly Ser Phe Glu Ala Asp leu Lys His leu lys Glu Lys> 



670 680 690 700 710 

X XX X X.X X XX X 

CTG TCI GCG GGA GCC CAT TTC ATC ATC ACG CAG CTT TTC TTT GAG GCT GAC 

CAC ACA CCC CCT CCC CIA AAG TAG TAC TGC GTC GAA AAG AAA CTC CCA CTC 

Val Set Ala Gly Ala Asp Plie He He Tti r Gin l eu Plie Ptie Glu Ala Asp» 
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1 2 0 7 3 0 7 U 

XX X XX X 

ACA TIC lie fOC ITT GIG AAG G[A I6C 
TGI AAO AAG GCG AAA [At TIC CGI A[G 
Ihr Phe Phe Arg Phe Val Lys Ala Cys 



ISO 760 

XX X XX 

A[[ GA[ ATG GGt AT[ ACT TGC CCC 

IGG CTG lAC CCG TAG IGA ACG GGG 

Ihr Asp Het Gly lie Ihr [ys Pro' 



770 180 790 800 810 

XXX X XXX XXX 

AIC GK CCC GGG Alt ITI ttt AH tAG GGC lAt tAC Ttt fll CGG CAG til 

TAG CAG GGG CCC TAG AAA GGG TAG GIC CCG ATG GIG AGG 6AA GCt GTC GAA 

He Val Pro Gly He Phe Pro He Gin Gly lyr His Ser leu Arg Gin Leu» 



870 830 8(0 8S0 860 

XX X XX X XX X X 

GIG AAG tic ICC AAG CIG GAG GIG CCA CAG GAG AIC AAG GAC GIG AIT GAG 

CAC TIC GAC AGG TIC GAC CIC CAC GGT GTC CTC TAG TIC CTG CAC TAA CTC 

Val Lys leu Ser lys leu Giu Val Pro Gin Glu He Lys Asp Val He Glu> 



870 880 890 

X X X X X 

CCA AIC AAA GAC AAC GAI GCT GCC 

GGT TAG HI CIG TIG CIA CGA CGG 

Pro He lys Asp Asn Asp Ala Ala 



900 910 

XX X XX 

ATC CGC AAC TAT GGC AIC GAG CTG GCC 

1AG GCG TIG ATA CCG TAG CTC GAC CGG 

He Arg Asn lyr Gly He Glu leu Ala> 



920 9 3 0 9U 

X X X X X 

GIG A6C CTG TGC [AG GAG CTI CIG 

CAC ICG GAC ACG GTC CTC GAA GAC 

Val Ser leu Cys Gin Glu leu leu 



950 960 

X X X X X 

GCC AGI GGC TIG GIG CCA GGC CTC CAC 

CGG TCA CCG AAC CA( GGT CCG GAG GTG 

Ala Ser Gly leu Val Pro Gly leu His> 



970 980 990 1000 1010 1070 

XX X XX X XX X XX 

TIC TAC ACC CIC AAC CGC GAG AIG GCT ACC ACA GAG GIG CTG AAG [G[ [TG 

AAG ATG IGG GAG TIG GCG (TC TAC CGA IGG TGT CTC CAC GAC TIC GCG GAC 

Phe lyr Ihr leu Asn Arg Glu Het Ala Ihr Ihr Glu Val leu Lys Arg leu> 

1030 1040 10S0 1060 1070 

X «X X X X X XX X 

GGG ATG IGG ACT GAG GAC CCC AGG CGI CCC CIA CCC IGG GCT CIC AGI GCC 

CCC TAC ACC IGA CTC CIG GGG ICC GCA GGG GAI GGG ACC CGA GAG TCA CGG 

Gly Met Irp Ihr Glu Asp Pro Arg Arg Pro leu Pro Irp Ala leu Ser Ala. 
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1080 1090 1100 1110 1170 

XX XXX XXX XX 

CAC CC( AAG CGC CGA GAG GAA GAT GIA CGT CCC ATC TTC TG6 GCC T(( AGA 
GTG 6GG TT[ GCG GCT CU CTT (TA CAl GCA 6GG TAG AAG ACC CGG AGG TtT 
His Pro lys Arg Arg Glu Glu Asp Yal Arg Pro He Phe Trp Ala Ser Arg> 



1130 1U0 

X X X X X 

CCA AAG AGT TAG ATC TAC CGT ACC 

GGT TTC TCA ATG TAG ATG GCA TGG 

Pro Lys Ser Tyr He Tyr Arg Thr 



11S0 1160 1110 

XX X XX 

CAG GAG TGG GAC GAG TTC CCT AAC GGC 

GTC CTC ACC CTG CTC AAG GGA TTG [CG 

Gin Glu Trp Asp Giu Phe Pro Asn Gl|f> 



1180 1190 1700 

X XX X XX 

CGC TGG GGC AAT TCC TCT TCC CCT GCC 

GCG ACC CCG TTA AGG AGA AGG GGA CGG 

Arg Trp Gly Asn Ser Ser Ser Pro Ala 

1230 1240 1250 

X X X XXX 

CTC TTC TAC CTG AAG AGC AAG TCC CCC 

GAG AAG ATG GAC TTC TCO TTC AGG GGG 

Leu Phe Tyr Leu Lys Ser Lys Ser Pro 



1210 1220 

X ' X X X 

TTT GGG GAG CTG AAG GAC TAC TAC 

AAA CCC CTC GAC TTC CTG ATG ATG 

Phe Gly Glu Leu Lys Asp Tyr Tyr. 



1260 1270 

XX X XX 

AAG GAG GAG CTG CTG AAG ATG TGG 

TTC CTC CTC GAC GAC TTC TAC ACC 

Lys Glu Glu Leu Leu Lys Het Trp> 



1280 1290 1300 

X X X X X 

GGG GAG GAG CTG ACC AGT GAA GCA AGT 

CCC CTC CTC GAC TGG TCA CTT CGT TCA 

Gly Glu Glu Leu Thr Ser Glu Ala Ser 



1310 1320 

X X X K X 

GTC TTT GAA GTC TTT GTT CTT TAC 
CAG AAA CTT CAG AAA CAA GAA ATG 
Val Phe Glu Val Phe Val Leu Tyr> 



1330 1340 1350 

XX X XX 

CTC TCG GGA GAA CCA AAC CGG AAT GGT 

GAG AGC CCT CTT GGT TTG GCC TTA CCA 

Leu Ser Gly Glu Pro Asn Arg Asn Gly 



1360 1310 

X X X X X 

CAC AAA GTG ACT TGC CTG CCC TGG 
GTG TTT CAC TGA ACG GAC GGG ACC 
His Lys Val Thr Cys Leu Pro Trp. 



1 380 1 390 UOO 

X I X > > 

AAC GAT GAG CCC CTG GCG GCT GAG ACC 

TTG CTA CTC GGG GAC CGC CGA CTC TGG 

Asn Asp Glu Pro Leu Ala Ala Glu Thr 



U10 U20 

XX X XX 

AGC CTG CTG AAG GAG GAG CTG CTG 
TCG GAC GAC TTC CTC CTC GAC GAC 
Ser Leu Leu Lys Glu Glu Leu Leu. 

^^=r ITl 
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U30 1U0 USO U60 U70 

[6G GT6 AAC CGC [AG GGC AT[ CTC A[C ATC AAC TCA [AG t[C AA[ AT( AAC 

G[[ [AC TTG GCG GT[ [[& TAG GAG TGG TAG TTG AGT GK GGG TTG TAG TIG 

Arq Val Asn Arg Gin Gly He leu Thr He Asn Ser Gin Pro Asn He Asn> 



U8fl U90 ISflO 

XX X XX 

GGG AAG (CG TCC Kt GA( C(( ATC 

CCC TTC GGC AGG AGG CTG GGG TAG 

Gly Lys Pro Ser Ser Asp Pro He 



1S10 1520 1530 

X XX X XX 

GTG GGC TGG GGC CCC AGC GGG GGC TAT 
CAC CCG ACC CCG GGG TC6 CCC CCG ATA 
Val Gly Trp Gly Pro Ser Gly Gly Tyr. 



1540 1550 

X X X X 

GTC TTC CAG AAG GCC TAC TTA GAG 

CAG AAG GTC TTC CGG ATG AAT CTC 

Val Phe Gin lys Ala Tyr Leu Glu 



1560 1570 1580 

XX X XX X 

TTT TTC ACT TCC CGC CAG ACA GCG GAA 

AAA AAG TGA AGG GCG CTC TGT CGC CTT 

Pile Phe Thr Ser Arg Glu Thr Ala Glu» 



1590 1600 

XX X XX 

GCA CTT CTG CAA GTG CTG AAG AAG 

CGT GAA GAC GTT CAC GAC TTC TTC 

Ala leu Leu Gin Val Leu Lys Lys 



1610 1620 1630 

X X X X X 

TAC GAG CTC CGG GTT AAT TAC CAC CTT 
ATG CTC GAG GCC CAA TTA ATG CTG GAA 
Tyr Glu Leu Arg Val Asn Tyr His leu» 



16U 1650 

> > > < ( 

GTC AAT GTG AAG GGT CAA AAC ATC 

CAG TTA CAC TTC CCA CTT TTG TAG 

Val Asn Val Lys Gly Glu Asn He 



1660 1670 1680 

XX X XX 

ACC AAT GCC CCT GAA CTG CAG CCG AAT 
TGG TTA CGG GGA CTT GAC GTC GGC TTA 
Thr Asn Ala Pro Glu Leu Gin Pro Asn> 



1690 1700 1710 1720 1730 

CCT GTC ACT TGG GGC ATC TTC CCT GGG CGA GAG ATC ATC CAG CCC ACC GTA 

CGA CAG TGA ACC CCG TAG AAG GGA CCC GCT CTC TAG TAG GTC GGG TGG CAT 

Ala Val Thr Trp Gly He Phe Pro Gly Arq Glu He He Gin Pro Thr Val. 



1740 



1750 



1760 



1770 



1780 



GTG GAT CCC GTC AGC TTC ATG TTC TGG AAG GAC GAG GCC TTT GCC CTG TGG 
CAC CTA GGC CAG TCC AAG TAC AAG ACC TTC CTG CTC CCG AAA CGG GAC ACC 
Val Asp Pro Val Ser Phe Hef Phe Trp Lys Asp Glu Ala Phe Ala leu Trp. 
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1790 1800 1810 1820 1830 

ATT GAG CGG TGG GGA AAG CT6 TAT GAO GAG GAG TCC [[G TCC tG[ ACC ATC 

TAA CTC Gtt ACC CCT TTC GAC ATA CTC CTC CTC AGG GGC A6G GCG TGG TAG 

He Glu Arg Trp Gly lys Leu lyr GIu Glu Glu Ser Pro Ser Arg Thr Ile> 



1840 1BS0 1860 1870 1880 

ATC CAG TAC ATC CAC GAC AAC TAC TTC CTC GTC AAC CTG GTG GAC AAI GAC 
TAG GTC ATG TAG GTG CTG TTG ATO AAG GAC CAG TTG GAC CAC CTG TTA CTC 
He Gin Tyr He His Asp Asn Tyr Phe Leu Val Asn Leu Val Asp Asn Asp> 



1890 1900 1910 1920 1930 

TTC CCA CTG GAC AAC TGC CTC TGG CAG GTG GTG 6AA GAC ACA TTG GAG CTT 

AAG G6T CAC CTG TTG ACG GAG ACC GTC CAC CAC CTT CTG TGT AAC CTC GAA 

Phe Pro Leu Asp Asn Cys Leu Trp Gin Vai Val Glu Asp Thr Leu Glu Leu> 



1940 1950 1960 1970 1980 1990 

« IX X XX X xxxx 

CTC AAC AGG CCC ACC CAG AAT GCG A6A GAA ACG CAG CCT CCA I6ACCCTGCG 

GAG TTG TCC GGG TGG GTC TTA CCC TCT CTT TGC CTC CGA GGT ACTGGCACGC 

Leu Asn Arg Pro Thr Gin Asn Ala Arg Glu Thr Glu Ala Pro> 

2000 2010 2020 2030 2040 2050 

XX XS XX XX XX XX 

TCCTCACGCC CTGCGTTGGA GCCACTCCTG TCCCGCCTTC CTCCTCCACA GTGCT6CTTC 

AG6ACTGCGG GACGCAACCT CGGTGAGGAC AGGGCGGAAG GAGGAGGTGT CAC6ACCAAG 



2060 2070 2080 2090 2100 2110 

XX XX XX xx xx XX 

ICTT06GAAC TCCACTdtC TTCGTGTCTC TCCCACCCCG G(CT[[ACTC CCCCACCIGA 

AGAACtCTTG AGGTGAGAGG AAGCACAGAG AGGGTGGGGC CGGAGGTGAG GGGGTGGACT 



2 1 20 2 1 30 2 U 0 2 1 5 0 2 1 60 2 1 7 0 

XX XX XI XX XX XX 

CAATGG(AGC TAGAtTGOAG TGAOGCIICC AGGCTCTICC TGGACCTGAG TCGGtCCCAC 

GTIACCGUG ATCTGACCIC ACICCGAAGG KCGAGAAGG ACCTGGACTC AGCCGGGGTG 



2180 2190 2200 2210 2220 

XX XX XX XX XX 

ATGQGAACCI AGTACTCTCT GCTCTAAAAA AAAAAAAAAA AAAGGAATTC 

TACCCITGGA KATGAGAGA (GAGATTTTT HITTTTTTT TTTCCTTAA6 
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ARG 

9C 



f 


TER 




GATC 


*A 


GATC 








CC 






T 
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ARG 



G AT C 



OLN 



G A T C 
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♦AC 
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ALA VAL 

GATC GATC 
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AAI UC OGA 


I X 

GCC AIG {IG AAC 


X 

GAA GCC AGA GGA AAC AGC 


X 

AGC 


CIC 


X 

AAC CCC IGC 


X 

IIG GAG 


60 




Kel Val Asn 


Glu Ala Arg Gly Asn Ser 


Ser 


Leu Asa Pro Cys 


Leu 


Giu 


n 


GG( AG! G(( 


1 X 

AG( AOI G0( AGI 


X 

GAG AGC KC AAA GAI AG! 


X 

KG 


AGA 


IGI ICC ACC 


X 

CCG GGC 


m 


Gl]f Ser Ala 


Ser Ser Gly Ser 


61u Ser Ser Lys Asp Ser 


Ser 


Arg Cys Ser Ihr 


Pro Gly 


36 


CTG GAC CCr 


X X 

GAG [GG [AI GAG 


X 

AGA CIC CGG GAG AAG AIG 


X 

AGG 


CGG 


CGA TIG GAA 

VUn liu unn 


ICI 


GCI 


180 


le« Asp Pro 


Glu Arg His Glu 


Arg leu Arg Glu lys Hei 


Arg 


Arg 


Arg leu Glu 


Ser 


Gly 


56 


GAC AAG TGG 


X X 

IH K[ CIG GAA 


X 

nC lie CCI CCl CGA AC? 


X 

GCI 




X 

fifiA fifT fiTf 

UUn Ul 1 U 1 I 


AAT 


X 

r Tf 


2U 


Asp Lys Irp 


Phe Ser Leu Glu 


Phe Phe Pro Pro Arg Ihr 


Ala 


Glu 


Gly Ala Val 


Asn 


Leu 


16 


AT( UA AGG 


X X 

in GAC CGG AIG 


X 

GCA GCA GGI GGC CCC CIC 


X 

lAC 


ATA 

A 1 ft 


X 

f.Af r.Tfi Arr 

Ufll UIU ftll 


Tfifi 
1 uu 


X 

f Af 
I Al 


300 


He Ser Arg 


Phe Asp Arg Het 


Ala Ala Gly Gly Pro Leu 


lyr 


He 


Asp Val Ihr 


Irp 


His 


96 


([A GCA GGl 


X X 

GAC [[I GGC KA 


X 

GAC AAG GAG ACC ICC HC 


X 

AIG 


A 1 U 


X 

ATf r»fr AT.r 
All ull Aul 


If r 


X 

rr f 
uU 


360 


Pro Ah Gly 


Asp Pro Giy Ser 


Asp Lys Glu Ihr Ser Ser 


Hel 


Het 


He Ala Ser 


Ihr 


Ah 


116 


GIG AAC lAC 


X X 

IGI GGC CTG GAG 


X 

ACC AU (IG CAC AIG ACC 


X 

IGC 




X 

CGI CAC CGC 


CIC 


X 

GAG 


WO 


Val Asn lyr 


Cys Gly Leu Gtu 


Ihr He Leu His Hel Ihr 


Cys 


Cys 


Arg Gin Arg 


Leu 


Glu 


136 


GAG AIC ACG 


X X 

GGC CAI CIG CAC 


X 

AAA GCI AAG CAG CIG GGC 


X 

CIG 


AAG 


X 

AAC AIC AIG 


GCG 


X 

CIG 


(BO 


G!i/ He Ihr 


Gly His Leu His 


Lys Ala Lys Gin Leu Oly 


Leu 


Lys 


Asn He Het 


Ala 


Leu 


156 


(GG GGA GA[ 


X X 

CCA AIA GGI GAC 


X 

CAG IGG GAA GAG GAG GAG 


X 

GGA 


GGC 


X 

lie AAC lAC 


GCA 


X 

GIG 


sto 


Arg Gly Asp 


Pro He Gly Asp 


Gin Irp Giu Glu Glu Glu 


Gly 


Gly 


Phe Asn lyr 


Ala 


Val 


116 


GAC CIG GIG 


X X 

AAG [AC AIC CGA 


X 

AGI GAG III GGI GAC lAC 


X 

III 


GAC 


X 

AIC IGI GIG 


GCA 


X 

GGI 


600 


Asp Leu Val 


Lys His lie Arg 


Ser Glu Phe Gly Asp lyr 


Phe 


Asp 


He Cys Val 


Ala 


Gly 

X 


196 


TAC CCC AAA 


X X 

GGC CAC CCC GAA 


X 

GCA GGG AGC III GAG GCI 


X 

GAC 


CIG 


AAG CAC IIG 


AAG 


GAG 


660 


lyr Pro Lys 


Gly His Pro Glu 


Ala Gly Ser Phe Glu Ala 


Asp 


leu 


Lys His Leu 

X 


Lys 


Glu 

X 


216 


AAG GIG KI 


X X 

GCG GOA GCC &AI 


X ■ 

lie AIC AIC ACG CAG CII 


X 

IK 


III 


GAG GCI GAC 


ACA 


IK 


720 


Lys Val Ser 


Ala Gly Ala Asp 


Phe He He Ihr Gin Leu 


Phe 


Phe 


Glu Ala Asp 


Ihr 


Phe 


236 








— y 
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TK [G[ TIT GIG AA6 GCA TGC A(C GA( AIG GGt AI( Ad TGC (C( AT[ GT[ [(( GGG ATC 780 
Pbe Arg Phe Val lys Ala (ys Ihr Asp Het Gly He Thr [ys Pro He Val Pro Gly He 2S( 

III I K X 

ITI [[[ ATC CAG GGC lAC (A( T(C (IT (6G (AG CTI GIG AAG (TG I(( AAG (TG GAG GIG 8U 

Phe Pro He Gin Gly Tyr His Ser leu Arg Gin Leu Val Lys Leu Ser lys Leu Glu Val 276 

III III 

((A (AG GAG AH AAG GA( GIG ATT GAG (CA AT( AAA GA( AA( GAI G(I G(( AI( (G( AA( 900 

Pro Gin Glu He lys Asp Val He Glu Pro He Lys Asp Asn Asp Ala Ala He Arg Asn 296 

III III 

lA? GGC AH GAG CTG GtC GIG AGC tIG TGt (AG GAG (TT (TG C(( AGT GG( IIG GIG ((A 960 

Tyr Gly Hp Glu Leu Ala Val Ser Leu (ys Gin Glu leu leu Ala Set Gly leu Val Pro 316 

I I 1 I X X 

GG( (T( (A( TK IA( A(( (T( AA[ (G( GAG ATG G(I A[( A(A GAG GIG (TG AAG (G( (16 1020 
Gly leu His Phe Tyr Ihr leu Asn Arg Glu Hel Ala Ihr Thr Glu Val Leu Lys Arg Leu 336 

XII I X X 

GGG AIG IGG A(T GAG GA( ((( AG6 (GT ([( (lA ((( I6G G(I (I( AGT G(( CA( ([( AAG lOBO 
Gly Hel Irp Ihr Glu Asp Pro Arg Arg Pro Leu Pro Trp Ala Leo Ser Ala His Pro lys 3S6 

I I X XXI 

(GC (GA GAG GAA GAI GIA (GI ((( AT( IK IGG G[( K( AGA ((A AAG AGT IA( AK IA( lUO 

Arg Arg Glu Glu Asp Val Arg Pro He Phe Trp Ala Ser Arg Pro Lys Ser lyr He Tyr 316 

III III 

(GT A[( (AG GAG TG6 GAC GAG IK ((T AA( GG( (G( IGG GGt AAT I(( Id I(( ((I Gt( 1200 

Arg Thr Gin Glu Trp Asp Glu Phe Pro Asn Gly Arg Trp Gly Asn Ser Ser Ser Pro Ala 396 

I X X I X X 

TIT GGG GAG (TG AAG GA( TA( TA( (K IK IA( (TG AAG AGC AAG KC ((( AAG GAG GAG 1260 
Phe Gly Glu leu lys Asp lyr Tyr leu Phe Tyr leu Lys Ser lys Ser Pro lys Glu Glu 416 

I X X I X X 

(TG (IG AAG ATG IGG GGG GAG GAG (IC ACC AGT GAA G(A ACT GK III GAA CK TTI GTT 1320 
leu Leu lys Het Irp Gly Glu Glu leu Ihr Ser Glu Ala Ser Val Phe Olu Val Phe Val 436 

I I X I X X 

(TT TA( (K KG GGA GAA ((A AAC C6G AAT GGT CAt AAA GIG ACT TGC CTG CCC IGG AAC 1380 
leu Tyr leu Ser Gly Glu Pro Asn Arg Asn Gly His lys Val Thr Cys leu Pro Irp Asn 456 

III 1 X X 

GAI GAG CCC CTG GCG GCI GAG ACC AGC CTG CTG AAG GAG GAG CTG CIG CGG GIG AAC CGC 1440 
Asp Glu Pro leu Ala Ala Glu Thr Ser leu leu lys Glu Glu leu Leu Arg Val Asn Arg 415 



BNSDOCID: <WO 9533054A1 J_> 



SUBSTITUTE SHEET 



wo 95/33054 



PCT/CA95/00314 



13/18 



X X 

[A6 GGC AU CU ACC AK AAC UA (AG 
Gin Gly lie Leu Ihr lie Asn Ser Gin 


X 

CCC 
Pro 


AAC 
Asn 


AIC 
lie 


X 

AA( GGG 
Asn Gl; 


X 

AAG ((G I([ 
Lys Pro Ser 


ICC GAC 
Ser Asp 


X 

CCC 
Pro 


1S00 
U6 


X X 

AK GIG GGC IGG GGC CCC AGC GGG GGC 
He Ul Gly Irp Gly Pro Ser Gly Gly 


X 

lAI 
lyr 


GIC 
Val 


lie 

Phe 
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